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THE  FILTERING  PROBLEM  FOR  INFINITE  DIMENSIONAL  STOCHASTIC  PROCESSES 


G.  Kallianpur 

University  of  North  Carolina,  Chapel  Hill 

and 

R.L.  Karandikar 

Indian  Statistical  Institute,  New  Delhi 


Abstract 

The  paper  presents  some  recently  obtained  results  on  the  nonline¬ 
ar  filtering  problem  for  infinite  dimensional  processes.  The  optimal 
filter  is  obtained  as  the  unique  solution  of  certain  measure  valued 
equations.  Robustness  properties  -  both  pathwise  and  statistical  - 
are  given  and  a  preliminary  result  shows  consistency  with  the  sto¬ 
chastic  calculus  theory.  Applications  to  random  fields  and  models  of 
voltage  potential  in  neurophysiology  are  briefly  discussed. 


1.  Introduction 

The  finitely  additive  white  noise  approach  to  filtering,  smoothing 
and  prediction  developed  in  Kallianpur  and  Karandikar  [3,4]  is  partic¬ 
ularly  useful  when  the  stochastic  processes  involved  take  values  in  in¬ 
finite  dimensional  spaces.  Since  there  is  no  natural  measure  (such  as 
Lebesgue  measure)  in  infinite  dimensional  spaces,  the  optimal  filter 
is  given  by  a  measure  valued  differential  e  ation  in  which  the  ob¬ 
served  process  occurs  as  a  parameter.  The  existence  and  uniqueness  of 
solution  has  been  established  in  [3]  for  the  filtering  problem  to  which 
we  confine  ourselves  in  this  paper.  However,  the  result  in  [3]  is  not 
applicable  to  the  most  general  observation  model  and,  in  particular, 
does  not  cover  linear  filtering.  We  will  first  state  a  version  of  the 
filtering  result  that  is  a  significant  improvement  over  the  theorems 
obtained  in  [3]  and  then  show  how  it  can  be  applied  to  the  case  when 
the  system  process  is  a  Banach  or  Hilbert  space  valued  Markov  process, 


e.g.,  an  Ornstein-Uhlenbeck  process.  Certain  problems  involving  ran¬ 
dom  fields  (e.g.  horizontal  or  vertical  filtering)  can  also  be  solved 
using  the  white  noise  formulation.  We  conclude  by  discussing  the  ro¬ 
bustness  of  the  measure  valued  filter  including  a  property  which  we 
call  statistical  robustness  (Theorem  3).  Our  final  result  (Theorem  4) 
extends  some  of  our  consistency  results  to  the  measure  valued  case. 

Corresponding  results  for  prediction  and  smoothing  problems  will 
appear  in  our  forthcoming  work  [4]. 

For  notation  and  definitions  not  given  in  this  paper,  the  reader 
is  referred  to  [3]. 

2.  Measure-valued  Equations  for  the  Optimal  Filter 

Let  X  =  ( X^_ )  be  a  Markov  process  defined  on  a  complete  probability 
space  and  taking  values  in  a  Polish  space  S.  Let  K  be  a  sepa¬ 

rable  Hilbert  space  with  inner  product  (  ,  and  norm  1 1  •  !|  ^  and  let 

T 

H:=L2([0,T],K)  :  =  {  ?:  (  0  ,  T]  -*•  K  with  |  |  $  |J  2  :=  /||  ♦>  ||  Jds  <  »}. 

0 

Let  h  :  [0,T]  -  S  * K  be  a  measurable  function  such  that 

T 

/i:h  (Xa  (*))•',  2ds  <  -  (1) 

0  S  S  K 

for  every  ■ 

Denote  by  (H,C,m)  the  finitely  additive  Gaussian  white  noise  meas¬ 
ure  and  by  e,  the  identity  map  from  H  to  H.  Let  ^  (uj)  be  the  A/„  - 

•  o  (H) 


measurable,  H-valued  process  =  h  (X^  (*)  )  ,  t  <-  [0,T].  Both  and  e 

can  be  defined  in  an  obvious  manner  on  (E,f,a)  =  (:1 ,  A  ,  n  )  *  (H ,  C  ,m)  .  Then 
the  abstract  filtering  model 

y  =  +  e  (2) 

takes  the  more  concrete  form 

yt  =  ht  (V  +  et'  0  *  t  '  T-  (3) 


In  (3)  (y  )  is  the  observed  process  and  (e  )  is  K-valued  white  noise. 


Although  our  results  also  include  the  case  of  finite  dimensional  K,  as 
mentioned  earlier,  we  are  concerned  in  this  paper  with  applications 
where  dimK  =  =°. 

Letting  Q  be  the  orthogonal  projection  on  H  with  range 

m  O 

H  :  =  {  v  •  H  :  /  j  j  3  !|  ^ds  =  0 }  ( 0  t  t  ^  T)  ,  it  can  be  shown  that  the  condi- 

t  t  1  S  K 

tional  expectation  (g  (X^ )  j Qty]  exists  in  the  finitely  additive  theo¬ 
ry  and  is  given  by  the  Bayes  formula 

^ (g/Q^Y) 

E.,lg(V!V  -  SVd.Q.y)  141 

wt '  re 

t  ,t  2 

^(g/Q.y)  ^  /g(X  (u))exp[/(y  ,h  (X  (oj)))^ds  -  ±/]|h  (X  (w) )  ||  Lds}I!  (dw) .  (5) 

tt  o  t  0  s  s  s  K  Zq  s  s  k 

It  is  assumed,  of  course,  that  g  :  S  ■+  IR  is  such  that  g  ( X  )  is  "-inte- 

grable . 

The  main  problem  is  to  determine,  recursively,  the  conditional  dis 
tribution  ( • )  of  Xfc  given  y  ,  0  <  s  -  t  or,  equivalently,  the  unnor- 

y 

malized  conditional  distribution  where 


v  ‘tlB'  r  v 

F;  (B)  =  — -  and  3  (g,Q  v)  =  /g(x)FMdx). 

r*(s)  r  s 

Let  M  :=  M(S,S(S)  be  the  class  of  all  finite  Borel  measures  on  S. 
Before  stating  our  result  we  need  to  impose  the  following  conditions 
on  X:  Let  ID  :=  D([0,»),S)  be  the  Skorokhod  space  of  functions  from 
[ 0 , ^ )  to  S  which  are  right  continuous  and  have  left  limits  and  let 
4^  :=  :'xu'  s  •  u  •  t}  where  0  •  s  t  < »  and  (x  )  are  the  coordinate  maps 
(A) (i)  The  paths  of  the  process  X  belong  to  ID; 

(ii)  The  Markov  process  X  admits  a  transition  probability  function 
P (••;••) ; 

(lii)  For  all  (s,x)  •  [ 0 , 1 )  •  S,  there  exists  a  (countably  additive) 

<5 

probability  measure  P  on  (10,4")  such  that  for  all  k  1, 

s ,  x 


0  s  -  tj_  <  .  .  .  <  and  ,  •  .  .  ,  .  8  (S )  we  have  (with  tg  =  s , 

y0=x). 

k 

Ps,x(xt.  '  Ai'  1-i"kl  =  /•  •  •  /  ni1A.  (y  i  >  p  (ti-i'yi-i;tk'dyi)  * 

The  extended  generator  of  X  will  be  denoted  by  L  and  its  domain  by  V. 
More  precisely,  L  is  the  generator  of  Xfc  :  =  (t,Xfc)  with  augmented  state 
space  [0,»)  -S  and  stationary  transition  probabilities.  Let  (v®)  be 
the  two  parameter  semigroup  of  ( X^_ )  and  (T  )  ,  the  one  parameter  semi¬ 
group  of  (X  )  with  extended  generator  L  and  domain  V.  See  [3]. 

Theorem  1.  Let  the  S-valued  Markov  process  X  satisfy  Condition  (A) 


El  1  h  (x  )  ':  "ds 

k  S  S  K 


For  each  ^  •  H  let  c  '  ( x)  :  [0,T]  *  S  IR  be  defined  by 

s 


c J  (x)  =  (h„  (x)  ,  n  )  ^  -  il! 


Denote  bv  V' 


Vk  -  !l'hs(x)!'r  (7) 

the  semigroup  corresponding  to  the  transition  pro¬ 


babilities  of  X  and  also  write  Ng  :=  HoXg  •  Then  the  following  conclu¬ 
sions  hold  for  all  y  •  H: 

„y  . 

(a)  ■  i  ls  the  unique  solution  of  the  integral  equation 


(t,  •  )  ,7V  ■  =  <  f  (0  ,  ’  )  ,  Nn  >  +  /<(Lf+c^f)(s,*),r-1'>ds,  f  •  V 


(b)  7*  is  the  unique  solution  of  the  equation 

D  t 

"g'"t  '  =  <Vtg,JV  +  ^Cs(V^g)  (O.r^ds 

for  all  g  ■  IB(S) ,  the  bounded,  Borel  measurable  functions  on  S. 

(c)  is  the  unique  solution  to 


. -v-v 


**  *-,  "-E  -»  ~-T  -U*  S 


v  .■  *>  v  y 


(10) 


;  f  ( t ,  • )  /  >  =  <  f  ( 0  ,  •  )  ,  Nq>  +  /<  (Lf  +  c^f)  (s,  •  )  ,Fg>ds 


-  /<cy  (F^xf  ,F^>ds,  f  V , 
0  s  s  s 


y 

(d)  FJ  is  the  unique  solution  of  the  equation 


t  t 

q,Ft"  =  <Vtg'N0>  +  /<Cs(Vtg)'Fs>ds  "  /<Cs'Fg><Vtg'Fs>ds  (11) 


for  all  g  c  IB  (S)  . 

The  uniqueness  property  asserted  above  is  to  be  understood  to  hold 
in  the  class  of  {K  }  £  M  satisfying  the  following  conditions: 


For  A  £  B  ( S )  ,  K  (A)  =E71A(XQ)  and  t-K  (A)  is  a  bounded  Borel 


measurable  function; 


(12) 


-1 


Kfc  is  absolutely  continuous  with  respect  to  HoXt  and 


dK . 


dFoX 


-1 


<  R,  0  <  t  <  T, 


(13) 


t. 


for  a  suitable  constant  R  <  ». 


3.  Examples 

The  theorems  of  the  preceding  section  can  be  applied 
problems  of  filtering  of  random  fields.  Suppose  that  X  = 
0  <s  -  T,  0  x  r  b  is  a  two  parameter,  real-valued,  sample 
process.  We  shall  refer  to  x  as  the  spatial  parameter, 
tion  model  is 
tx  . 


y 


tx 


ff 

J  J 
00 


h  (X 
s  sv 


dvds  +  W. 

tx 


to  certain 
(X  )  , 
continuous 
The  observa- 


(14) 


where  the  following  conditions  are  imposed: 

(i)  W=  (W  )  is  a  standard,  Yeh-Wiener  process. 

(ii)  X  and  W  are  independent. 

(iii)  IE  /q/q  [^s  (XSx)  l2dxds  <  «. 

The  assumptions  on  X  allow  us  to  regard  X  as  a  sample  continuous  pro- 


So  we  have 


cess  taking  values  m  the  Banach  space  B  :=  Cq ( [ 0 , b] ,  IR)  . 

S=B  and  Xt  e  C  ( [0  ,T]  ;  B)  .  We  shall  make  the  further  assumption  that  X 
is  a  (B-valued)  Markov  process.  Consider  now  the  white  noise  model. 

Take  K  ~H,  the  (reproducing  kernel)  Hilbert  space  of  real  valued  abso¬ 
lutely  continuous  functions  on  [0,b]  with  square  integrable  derivatives. 
Let  h  :  [0  ,T]  *  B  H  be  given  by 

h  (n) (x)  :=  /h  (n  )dv,  n  e B. 
r  0 

Then  the  white  noise  observation  model  is  given  by 

yt  =  ht(Xt)  +  et'  0  -  t  -T'  (15) 

where  e  is  H-valued  Gaussian  white  noise.  For  more  details  and  a 
somewhat  different  treatment  of  this  problem,  see  [7]. 

The  conversion  of  the  filtering  problem  involving  a  two  parameter 
random  field  into  a  filtering  problem  for  an  infinite  dimensional  (in 
this  case  B-valued)  signal  process  is  useful  in  cases  when  observations 
can  be  assumed  to  be  available  for  all  values  of  the  spatial  variable 
x.  The  latter  situation  occurs  when  X  represents  the  (random)  volt- 

LX 

age  potential  at  time  t  and  "site"  x  of  a  spatially  extended  neuron, 

the  neuron  being  modeled  as  a  thin  cylinder  or  segment  [0,bj  (see 

[5,6]).  Another  example  in  which  neither  parameter  has  the  connotation 

of  time  occurs  in  problems  of  physical  geodesy  [1]. 

An  example  of  a  B-valued  Markov  process  X  is  the  Malliavin  Orn- 

stein  Uhlenbeck  (O-U)  process  which  is  given  by  the  unique  solution  of  the 

SDE  dfcXtx  =  -  jX^dt  +dBfcx,  Xqx--  B  (X ^  Gaussian)  where  B  =  (B  )  is  a 

Yeh-Cameron  Wiener  process.  The  generator  L  (the  so-called  Ornstein- 

Uhlenbeck  operator)  is  well  known:  For  0  < x, , . . . ,  x  <b  and  for  func- 

1  n 

tions  f  :  B  -*■  IR  of  the  form 

f (n)  =  f(n(x  ),...,  n ( x  )) 

i  n 

^  2 

where  f  f  C,  (IRn)  we  have 
b 


vyv- .VV/vVv  v  v  v  v  v,  .•v.v.v.--,  /.  • .  y-/ 


(Lf)  (n) 


2 


l 

i/  j  =  l 


(X . Ax . ) 
1  ] 


d  £ 


3x . 3x . 

1  3 


(n  (x^) 


/  *  • 


1 

2 


n 


I  n(x.) 

i=l  1 


3f 
3x . 

l 


(n  (x^)  ,  •  •  •  , 


,  n(xn)) 
n (xn) )  . 


Other  examples  in  which  the  infinite  dimensional  signal  process  X 
does  not  arise  from  a  multiparameter  stochastic  process  also  occurs  in 
the  study  of  neuronal  behavior  when  the  geometrical  shape  of  the  neuron 
is  more  complicated.  Even  in  the  simplest  such  models,  X  is  an  0-U 
process  (more  generally,  a  non-Gaussian ,  diffusion  process)  whose  sample 
paths  lie  in  C([0,T],K)  where  K  is  some  infinite  dimensional  Hilbert 
space.  A  natural  filtering  problem  is  given  by  the  linear  model 


=  Xt  +  et 


which  follows  from  (3)  by  taking  B  =  K  and  h  (n)  =n- 


4.  Robustness  of  the  Measure  Valued  Optimal  Filter 

We  shall  now  state  some  typical  results  on  the  robustness  properties 
of  the  measure  valued  optimal  filter.  These  are  extensions  of  results 
recently  obtained  by  H.P.  Hucke  in  his  thesis  [2],  Robustness  is  used 
in  two  senses  here:  the  first  in  the  sense  that  has  become  convention¬ 
al  in  the  theory,  viz.,  the  continuous  pathwise  dependence  of  the  opti¬ 
mal  filter  on  the  observations;  the  second  is  robustness  as  commonly 
used  in  statistics. 

Write  M  :=  IE/Jjjh  (X  )  ||  Jds  <  •*>. 

U  S  S 

Theorem  2.  Let  y,y'  ■  H  and  denote  the  total  variation  norm 


-  i  _  r  /  i 

‘  t  '  t  1 Var 


sup  j  (A)  -  rY'  (a)  ; . 

A-:  6  ( S )  t 


Then 


-y  _  -Y '  ii 

'  t  t  11  Var 


M-  I JQ  y  -  Q  y ' 


Theorem  3.  Let  (Xk)  (k=l,2,...) 


•exp(i;|Qty  j’  2  +  J I  i  Qty '  !'  2  )  . 

be  signal  processes  satisfying  assump- 


tion  (A)  of  Theorem  1.  Suppose  further  that  the  following  conditions 
are  satisfied: 

(a)  h  :  [ 0  , T ]  *S  -*■  K  is  continuous. 

k  -1 

(b)  The  sequence  of  measures  n°  (X  )  converges  weakly  (---=>)  to 
y.ox'1  in  (ID,  6(D)  )  . 

(c)  HoX  ■*"  (  [x  e  ID  :  x  ^  x^_]  ^  =  0  for  each  t . 

Then  for  every  y  eH,  =>  in  M(S)  (in  the  topology  of  weak  con¬ 

vergence  .  ) 

The  statistical  significance  of  the  above  result  is  clear.  It  says 
that  small  changes  in  the  distribution  of  the  signal  process  cause  only 
small  deviations  in  the  optimal  filter.  Further  implications  of  stat¬ 
istical  robustness  in  specific  problems  (such  as  the  Kalman  filter  with 
non-Gaussian  initial  distribution)  will  be  discussed  in  a  later  work. 

To  what  extent  do  the  results  of  this  paper  imply  analogous  re¬ 
sults  for  the  stochastic  calculus  theory?  Such  consistency  results 
when  signal  and  noise  are  finite  dimensional  have  been  presented  in 
[3].  For  the  infinite  dimensional  case,  results  at  this  level  of 
generality  do  not  seem  to  have  been  obtained  in  the  conventional  theory 
Nevertheless,  using  the  white  noise  model  of  this  paper  one  might  hope 
to  derive  robust  versions  for  the  countably  additive  model 
of  at  least  some  of  our  results.  We  conclude  this  topic  by  stating  a 
preliminary  result  of  this  kind. 

Let  (v,K,B)  be  an  abstract  Wiener  space,  y  :K«u.B,  X  :=  Cg([0,T],B) 

2 

and  ,  the  Wiener  measure  on  (X,S(X)).  Recalling  that  H  =  L  ([0,T],K), 
a  representation  space  for  (H,C,m)  is  given  by  (X,8(X),y).  Denote  by 
W  ,  the  coordinate  map  on  X.  Letting  (?i.  A, 6)  =  (ft ,  A ,  n )  ®  ( X ,  8  (X)  ,  y )  and 
defining  all  processes  involved  on  the  product  space  2  in  the  usual 
manner  we  obtain  a  stochastic  calculus  model  corresponding  to  (15) 


~Y  ~  Y 

Finally  let  T  ,  F^_  be  the  conditional  measures  for  the  optimal  filter 

for  (16).  Let  0#  stand  for  either  one  of  these.  Similarly  let  •/  be 

v  y 

either  one  of  F^  or  F^_. 

Theorem  4. 

(a)  Let  g  ?  i^(i!,A,n)  where  g(w)  =  f(Xt(u>)).  Then,  with  denoting 
the  lifting  map  we  have 

Rci[Ea(g'Qty)  1  =  3En<9lFt)* 

(b)  Let  f  :  S  -*•  IR  be  bounded  and  continuous.  Then 
R  [<f,0Y>]  =  <  f , 0Y> 

where  <f , v>  :=  Jf (x) dv (x)  for  u  eM(S). 
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